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THE HOMOGENEOUS CHAOS. 


By NorsBert WIENER. 


—— 


Introduction. Physical need for theory. 

Definition. Types of chaos. 

Classical ergodic theorem. Lebesgue form. 

Dominated ergodic theorem. Multidimensional ergodic theorem. 
Metric transitivity. Space and phase averages in a chaos. 

Pure one-dimensional chaos. 

Pure multidimensional chaos. 

Phase averages in a pure chaos, 


SO ee Be ae ee a 


Forms of chaos derivable from a pure chaos. 

10. Chaos theory and spectra. 

11. The discrete chaos. 

12. The weak approximation theorem for the polynomial chaos. 
13. The physical problem. The transformations of a chaos. 


1. Introduction. Physical need for theory. Statistical mechanics 
may be defined as the application of the concepts of Lebesgue integration to 
mechanics. Historically, this is perhaps putting the cart before the horse. 
Statistical mechanics developed through the entire latter half of the nineteenth 
century before the Lebesgue integral was discovered. Nevertheless, it de- 
veloped without an adequate armory of concepts and mathematical technique, 
which is only now in the process of development at the hands of the modern 
school of students of integral theory. 

In the more primitive forms of statistical mechanics, the integration or 
summation was taken over the manifold particles of a single homogeneous 
dynamical system, as in the case of the perfect gas. In its more mature form, 
due to Gibbs, the integration is performed over a parameter of distribution, 
numerical or not, serving to label the constituent systems of a dynamical 
ensemble, evolving under identical laws of force, but differing in their initial 
conditions. Nevertheless, the study of the mode in which this parameter of 
distribution enters into the individual systems of the ensemble does not seem 
to have received much explicit study. The parameter of distribution is essen- 
tially a parameter of integration only. As such, questions of dimensionality 
are indifferent to it, and it may be replaced by a numerical variable « with the 
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range (0,1). Any transformation leaving invariant the probability properties 
of the ensemble as a whole is then represented by a measure-preserving trans- 
formation of the interval (0,1) into itself. 

Among the simplest and most important ensembles of physics are those 
which have a spatially homogeneous character. Among these are the homo- 
geneous gas, the homogeneous liquid, the homogeneous state of turbulence. 
In these, while the individual systems may not be invariant under a change 
of origin, or, in other words, under the translation of space by a vector, the 
ensemble as a whole is invariant, and the individual systems are merely per- 
muted without change of probability. From what we have said, the trans- 
lations of space thus generate an Abelian group of equi-measure transformations 
of the parameter of distribution. 

One-dimensional groups of equi-measure transformations have become 
well known to the mathematicians during the past decade, as they lie at the 
root of Birkhoff’s famous ergodic theorem.’ This theorem asserts that if we 
have given a set S of finite measure, an integrable function f(P) on S, and 
a one-parameter Abelian group 7 of equi-measure transformations of S into 
itself, such that 


(1) PM Pe es Pe (— 0 <A<wo,—- wo cp ow), 


then for all points P on S§ except those of a set of zero measure, and provided 
certain conditions of measurability are satisfied, 


A 
(2) fares f f(TXP) da 
A-~Oo A 0 


will exist. Under certain more stringent conditions, known as metric transi- 
tivity, we shall have 


(3) tim Sf DPA J f(P)dVp 


almost everywhere. The ergodic theorem thus translates averages over an 
infinite range, taken with respect to A, into averages over the set S of finite 
measure. Even without metric transitivity, the ergodic theorem translates 
the distribution theory of A averages into.the theory of S averages. 

In the most familiar applications of the ergodic theorem, S is taken to be 
a spatial set, and the parameter A is identified with the time. The theorem 
thus becomes a way of translating time averages into space averages, in a 








1 Cf. Eberhard Hopf, “ Ergodentheorie,” Ergebnisse der Mathematik und ihrer Grene- 
gebiete, vol. 5. See particularly § 14, where further references to the literature are given. 
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manner which was postulated by Gibbs without rigorous justification, and which 
forms the entire basis of his methods. Strictly speaking, the space averages 
are generally in phase-space rather than in the ordinary geometrical space of 
three dimensions. There is no reason, however, why the parameter » should 
be confined to one taking on values of the time for its arguments, nor even 
why it should be a one-dimensional variable. We are thus driven to formulate 
and prove a multidimensional analogue of the classical Birkhoff theorem. 

In the ordinary Birkhoff theorem, the transformations 7 are taken to 
be one-one point transformations. Now, the ergodic theorem belongs funda- 
mentally to the abstract theory of the Lebesgue integral, and in this theory, 
individual points play no réle. In the study of chaos, individual values of 
the parameter of integration are equally unnatural as an object of study, and 
it becomes desirable to recast the ergodic theorem into a true Lebesgue form. 
This we do in paragraph 2. 

Of all the forms of chaos occurring in physics, there is only one class 
which has been studied with anything approaching completeness. This is the 
class of types of chaos connected with the theory of the Brownian motion. In 
this one-dimensional theory, there is a simple and powerful algorithm of phase 
averages, which the ergodic theorem readily converts into a theory of averages 
over the transformation group. This theory is easily generalized to spaces of 
a higher dimensionality, without any very fundamental alterations. We shall 
show that there is a certain sense in which these types of chaos are central in 
the theory, and allow us to approximate to all types. 

Physical theories of chaos, such as that of turbulence, or of the statistical 
theory of a gas or a liquid, may or may not be theories of equilibrium. In the 
general case, the statistical state of a chaotic system, subject to the laws of 
dynamics, will be a function of the time. The laws of dynamics produce a 
continuous transformation group, in which the chaos remains a chaos, but 
changes its character. This is at least the case in those systems which can 
continue to exist indefinitely in time without some catastrophe which essen- 
tially changes their dynamic character. The study, for example, of the de- 
velopment of a state of turbulence, depends on an existence theory which avoids 
the possibility of such a catastrophe. We shall close this paper by certain very 
general considerations concerning the demands of an existence theory of this 
sort. 


2. Definition. Types of Chaos. A continuous homogeneous chaos in 
n dimensions is a scalar or vector valued measurable function p(%,° °°, 23) 
of @,° °°, 23%, in which z,,- - -,2, assume all real values, while « ranges. 
over (0,1); and in which the set of values of « for which 
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(4) p(X ar Yi," * "5 2n + Yn; %) belongs to Ds 


if it has a measure for any set of values of y,,: - -, yn, has the same measure 
for any other set of values. In this paper, we shall confine our attention to 
scalar chaoses. A continuous homogeneous chaos is said to be metrically 
transitive, if whenever the sets of values of « for which p(a,°° -,2n; 4%) 
belongs to S and to Si, respectively, have measures M and M,, the set of values 
of « for which simultaneously 


p(%,° ° *,%3;%) belongs to S 
and 
p(t + Y1,° °° 5 en + Yn3%) belongs to S, 


has a measure which tends to M, as y? +° * > + Yn? > ©. 
If p is integrable, it determines the additive set-function 


(5) BR a) — fo f p(t + 1 an5 adda: + de 


On the other hand, not every additive set-function may be so defined. This 
suggests a more general definition of a homogeneous chaos, in which the chaos 
is defined to be a function (3; a), where « ranges over (0,1) and & belongs 
to some additively closed set = of measurable sets of points in n-space. We 
suppose that if S and 3, do not overlap, 


(6) O(S+ 315%) = G(S5%) + 5CEr5 @). 

We now define the new point-set 3(41,° °°, Yn) by the assertion 

(7) S(4iy° °°, Yn) contains 7, + y1,°° °,2%.-+ yx when and only when 
Scontaiie’ Gig * 2 ya. 


This leads to the definition of the additive set-function %y,,..., a (ae) by 
(8) Ou ie Se Sige ie un( 5%) = F( 3 (415° ; 5 Yn) 3%). 
If, then, for all classes S of real numbers, 


(9) Measure of set of a’s for which By,...,1,(%35%) belongs to class S 


is independent of ¥1,° °°, Yn, in the sense that if it exists for one set of these 
numbers, it exists for all sets, and has the same value, and if it is measurable 
in Yi,° °°» Yn, We Shall call & a homogeneous chaos. The notion of metrical 
transitivity is generalized in the obvious way, replacing p(a,° °°, @3%) by 
S(S5@), and p(a, + 41,5 ° +, 2u + Yn a) by Su,..., vn (%5 0). 

The theorem which we wish to prove is the following: 
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THEOREM I. Let 3(3;a) be a homogeneous chaos. Let the functional 


(10) {5 (55 %)} = 9() 


be a measurable function of a, such that J, | g(a) log* | g(a)| | da ts finite. 


Then for almost all values of «@, 


1 . 
(11) lim Tey f fF PB ss 2) Fy dy 


TOO 


exists, where R is the wntertor of the sphere 
(12) ys? Eyer eb yy? = 7? 


and V(r) ws its volume. If in addition, &(33;%) is metrically transitive, 


(18) dim Fo fw f OG, (Bia) dy dye =f “O(R(8s 8) ap 


r—>0O0 


for almost all values of a. 


3. Classical ergodic theorem. Lebesgue form. Theorem I is mani- 
festly a theorem of the ergodic type. Let it be noticed, however, that we 
nowhere assume that the transformation of « given by B =7'a when 


(14) us, ....mn( 238) = &(33 @) 


is one-one. This should not be surprising, as the ergodic theorem is funda- 
mentally one concerning the Lebesgue integral, and in the theory of the 
Lebesgue integral, individual points play no réle. 

Nevertheless, in the usual formulation of the ergodic theorem, the ex- 
pression f(7P) enters in an essential way. Can we give this a meaning 
without introducing the individual transform of an individual point? 

The clue to this les in the definition of the Lebesgue integral itself. If 
f(P) is to be integrated over a region S, we divide S into the regions Su,x(f), 
defined by the condition that over such a region, 


(15) gi Py =o. 
We now write 
(16) J P(P)aV» = lim ¥ nem (Serasene(f)). 


The condition that f(P) be integrable thus implies the condition that it be 
measurable, or that all the sets Sa,»(f) be measurable. 
Now, if 7 is a measure-preserving transformation on 9, the sets T’8a(f) 
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will all be measurable, and will have, respectively, the same measures as the 
sets Sao(f). We shall define the function f(7P) = g(P) by the conditions 


(17) DSa,v(f) = Sao(g)- 


If T conserves relations of inclusion of sets, up to sets of zero measure, this 
function will clearly be defined up to a set of values of P of zero measure, 
and we shall have 


(18) f, f(EP)aVe— J f(P)aV, 


We may thus formulate the original or discrete case of the Birkhoft 
ergodic theorem, as follows: Let S be a set of pownts of finite measure. Let T 
be a transformation of all measurable sub-sets of S into measurable sub-sets 
of S, which conserves measure, and the relation between two sets, that one 
contains the other except at most for a set of zero measure. Then except for 
a set of points P of zero measure, 

(19) lim —2—_ 3 f(TP) 
0 


; ; N-oo N + 1 
will exrst. 


The continuous analogue of this theorem needs to be formulated in a 
somewhat more restricted form, owing to the need of providing for the in- 
tegrability of the functions concerned. It reads: Let S be a set of points of 
finite measure. Let T® be a group of transformations fulfilling the conditions 
we have laid down for T in the discrete case just mentioned. Let T*P be 
measurable in the product space of A and of P. Then, except for a set of 
points P of zero measure, 


(20) lim f f(INP) ad 
A->00 A Jo 
will exist. 

In the proofs of Birkhoff’s ergodic theorem, as given by Khintchine and 
Hopf, no actual use is made of the fact that the transformation 7’ is one-one, 
and the proofs extend to our theorem as stated here, without any change. The 
restriction of measurability, or something to take its place, is really necessary 
for the correct formulation of Khintchine’s statement of the ergodic theorem, 
as Rademacher, von Neumann, and others have already pointed out.’ 

With the aid of the proper Lebesgue formulation of the ergodic theorem, 
the one-dimensional case of Theorem I follows at once. Actually more follows, 
as it is only necessary that g belong to L, instead of to the logarithmic class 


2Cf, J. v. Neumann, Annals of Mathematics, 2, vol. 33, p. 589, note II. 
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with which we replace it. To prove Theorem I in its full generality, we must 
establish a multidimensional ergodic theorem. 


4. Dominated ergodic theorem. Multidimensional ergodic theorem. 
As a lemma to the multidimensional ergodic theorem, we first wish to establish 
the fact that if the function f(P) in the ergodic theorem satisfies the condition 


(21) J, |f(@)| los" | F(P)| Ve < &, 


then the expressions (19) and (20) not only exist, but the limits in question 
will be approached under the domination of a summable function of P. We 
shall prove this in the discrete case, for the sake of simplicity, but the result 
goes over without difficulty to the continuous case. 

Let 7 be an equimeasure transformation of the set S of finite measure 
into itself, in the generalized sense of the last paragraph, and let W be a 
measurable sub-set of S, with the characteristic function W(P). Let U be 
the set of all points P for which some T-/P belongs to W. Let i(P), when 
P belongs to W, be defined as the smallest positive number n such that 7-"P 
belongs to W, and let us call it the index of P. Every point of W, except for 
a set of measure 0, will have a finite index, since if we write Woo for the set 
of points without a finite index, no two sets T”™Woo and 7"™Woo can overlap, 
while they all have the same measure and their sum has a finite measure. 
Thus except for a set of zero measure, we may divide W into the sets W5, 
each consisting of the points of W of index p. It is easy to show that if 
l=pcolisprp<cw,0S j<p,0Sj' <p’, the sets TW, and T! Wo 
can not overlap over a set of positive measure unless 7 = 7’, p =p’. Similarly, 
the sets T?W, and 7'-?’W, can not overlap over sets of positive measure, unless 
p = p’, and represent a dissection of W, except for a set of zero measure. Let 
us put Wpq for the logical product of Wp, and T?Wq; Wogr for the logical 
product of Wyqand T’*¢W,; and so on. Then if k is fixed, the sets T-’ Wop, . . . 0 
cover U (except for sets of zero measure) once as 7 goes from 0 to p;, once as 
it goes from p, to p, + po, and so on; making just k times between 0, inclusive, 
and p,+ po-+:- ++ pe, exclusive. Thus if Sx is the set of all the points 
in all the Wo.»....p, for which p, + p2+-°-:+-+ pr—K, the total measure 
of all the Sx’s for 24+* = K = 2% can not exceed km(S) /2%. 

Now let P lie in T-Wops... rp,» WhereO Sj < pit pets: ++ pr Let 


us consider the sequence of numbers aj, where if pi+:**+ piSJ7 
<pit:: ++ pis, aj is the greatest of the numbers 
1 2 
j—Mm—o mm tl? fom mPa tl 
3 eee 


a es 0 a a 
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Then a; will be the largest of the numbers 


PSHE)... WE ba 
os 

The sum > aj;, for a fixed K, will have its maximum oo when pi = pe = 

= 1, = 1: 3 Pe = K + 1—k, when it will be & + > k/j Sk(1+ log K/k). 


In this case the sequence of the a,’s will be 


W(P), 


i aenes kkk ke 
ats ot, 9 a2 + oF? 

boi’? k+2 K 
i: times 


This remark will be an easy consequence of the following fact: let us 
consider the sequence 


ee ae 


ae sas 
—— RT LE] tate 


and the modified sequence 




















we ‘er age 
1 ee M+ —_ 
Sy She Ee 
where of course 
mole > 
+ [FT] “Ey Ele [je 
ee ee oe) 

















it ES] EST + [3] TS 


Apart from the arrangement, the terms of (22) will be the same as the terms 


of (23), except that in (23), replaces A/n. Now, 





A+ 1 
[=| Son 


so that the sum of the terms in (23) is greater than that in (22). 


i 
a 


? 
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Since the transforms of a given set have the same measure as the set, and 
the sets T’/Sx cover U exactly k times, we have 


ae W(P) + max (w(P), UY TE) 


max (w(p), MP) WEP) WP) + WUEP) + WEP) 


foo -pmmax(w (py, ZED WEP), 


W(P) +--+ = | av 
<3 m(sx) (1 +108 F ) 
< m(W) (1+ log * 08 5 ) + 2 ae (1 + low “~) 
=< (: + log =) m(W) + const. ae ms) ps 








the constant being absolute. If we now put 





N= | log ate / log 2 | ; 


this last expression is dominated by 





m(8) 
const. m(W) (: + log mt) y 
Since the constant is independent of h&, we see that the Cesaro average of 


wx (wer), REDE WEP). ., WOR) bb ude) 


is dominated by the same term. Thus 


J. Hie (wiP),: . | MO bE) dV 


<= const. m(W) (1 +- log me) ’ 


and it follows by monotone convergence that there exists a function W*(P) 
such that 





* m (8) 
(24) f. W*(P)dVp < const. m(W) (1+ 10g ot 
and for all positive m, 

(25) W(P) +o + W(E"P) < W*(P). 


m -+- 1 
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Now let f(P) be a function such that 


(21) Jf LF) [Tog | f(P)| dV 2 < 
Let W‘) be the set of points such that 
(26) aN = | f(P)| S 2%, 


and let f‘)(P) be the function equal to f(P) over this set of points, and 0 
elsewhere. Then there exists a function f‘%)*(P), such that 


(27) POE EE) p04 (P) CO, Oye xy, 


and 
(N) < (N) en) N 
(28) ff (P)dVp S const. m(W ) (1 + log m (WO) a 


Hence if 


(29) E mw) (1+ tog =) QV < e 

and 

(30) fe (P) = Dh *(P), 

then 

(31) LO) I) = (P) (i=0, 1,2 <%), 
and 





: S 
(382) f. f*(P)dVp S const. § m(W) (: + log* een) Os 
However, 


(33) > m( WN?) (: + log* mo aN 


- OO 2N+1m, (8) 
=> | f(P)| dVp [ 1 + loge ——_+-_ 
~0O Skt ( J. | f(P)| 7m) 


. f. |f(Q)dVo 


+}, saa Chia ayo } 





{_ |rP)l ave 
78 
+ f_ | F(P)| log* | #(P)| dV. 

8 
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Thus f f*(P)dVp has an upper bound which is less than a function of 
S 


J. | f(P)| dVp and J. | f(P)| log* | f(P)| dVp, tending to 0 as they both 


tend to 0. This establishes our theorem of the existence of a uniform dominant. 
There is a sense in which (21) ts a best possible condition. That is, if 


(34) W(x) = 0(log* x), 
the condition 
(35) f. (Wl F(P)|) | f(P)| dV < 


is not sufficient for the existence of a uniform dominant. For let S be a set 
of measure 1, subdivided into mutually exclusive sets Sx, of measures respec- 
tively 2-". Let Sn be divided into mutually exclusive sets Sni,° °°, Sn,v,, all 
of equal measures. Let T transform Sn,x Into Sn, cui) (k < v"), and Sn,v, into 
Sn. Let f(P) be defined by 


(36) f(P)=a.>00n Sn, (n—1,2,---); f(P) =0 elsewhere. 


Then the smallest possible uniform dominant of 





i N 
(37) Vo & f(P"P) 
is 
(38) ft(P) == on Sra, 
and we have 
(39) f. ft (P)dVp = & MOC) ne. 
S n 
Thus if 
(40) Va = 27" On, On = A(n!), 


the function f*(P) will belong to Z if and only if 


AnQ (log an) 2” 
(41) ee eee 


> const. f. #(P) logt f(P) dVp. 


While we have proved the dominated ergodic theorem merely as a lemma 
for the multidimensional ergodic theorem, the theorem, and more particularly, 
the method by which we have proved it, have very considerable independent 
interest. We may use these methods to deduce von Neumann’s mean ergodic 
theorem from the Birkhoff theorem ; or vice versa, we may deduce the Birkhoff 
theorem, at least in the case of a function satisfying (21), from the von 
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Neumann theorem. These facts however are not relevant to the frame of 
the present paper, and will be published elsewhere. 

We shall now proceed to the proof of the multidimensional ergodic 
theorem, which we shall establish in the two-dimensional case, although the 
method is independent of the number of dimensions. Let 7,\7'.4 be a two- 
dimensional Abelian group of transformations of the set S (of measure 1) 
into itself, in the sense in which we have used this term in paragraph 3. Let 
T,\T.“P be measurable in A, »w, and P. We now introduce a new variable 2, 
ranging over (0,1), and form the product space & of P and z. We introduce 
the one-parameter group of transformations of this space, 7’, by putting 


(42) T'p (P, x) a (T° cos 2ra'T" Pp sin aa xv) 


The expressions 7°(P, x) will be measurable in p and (P, x), and the trans- 
formations 7? will all preserve measure on &. Thus by the ergodic theorem, 
for almost all points (P,x) of 3%, if f(P) =f(P, 72) belongs to L, the hmit 


A A 
(43) lim = f f(T9(P,2))dp—=lim 5 f f(T, 08 827 p sin 22eP) do 
A>0oo A 0 A->0oo A 0 


will exist. If condition (26) is satisfied, it will follow by dominated con- 
vergence that the limit 
1: 2" 2 
(44) ae ae f f (Tp 008 2raP ,p sin 2raP) dy 
Aso A Jo 0 

will exist for almost all points (P, x), and hence for almost all points P. 

For the moment, let us assume that f(P) is non-negative. Then there 
is a Tauberian theorem, due to the author,*? which establishes that the expres- 
sion (44) is equivalent to 


— 2 7A? 





A 21 
f pdp f. f(T cos OT’ .P sin oP) dé. 
0 0 


The only point of importance which we must establish in order to justify this 
Tauberian theorem is that 


1 
1L+ttd 5 <= ——__—_— 
(46) f, p'*"dp pion = 0) 


for real values of u. Since every function f(P) satisfying (21) is the ditf- 
ference of two non-negative functions satisfying this condition, (45) is estab- 
lished in the general case. 

It will be observed that we have established our multidimensional ergodic 


’N. Wiener, “ Tauberian theorems,” Annals of Mathematics, 2, vol. 33, p. 28. 
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theorem on the basis of assumption (21), and not on that of the weaker 
assumption that f(P) belongs to Z. What the actual state of affairs may be, 
we do not know. At any rate, all attempts to arrive at a direct analogue of 
the Khintchine proof for one dimension have broken down. The one-dimen- 
sional proof makes essential use of the fact that the difference of two intervals 
is always an interval, while the difference between two spheres is not always 
a sphere. 

The precise statement of the multidimensional ergodic theorem is the 
following: Let 8 be a set of points of measure 1, and let T\T,**: + - Ty be 
an Abelian group of equimeasure transformations of 8 into ttself, in the sense 
of paragraph 3. Let Ti+: - -T,P be measurable in Ax,° + +,An; P. Let R 
be the set of values of A1,: + +, An for which 


(47) AM? + Akt tA Sr? 


and let V(r) be its volume. Let f(P) satisfy the condition (21). Then for 
almost all values of P, 


Mis & * An bn aaa ae 
(48) lim ao) ii f f(TyM- © TP) dry + * Os 
exists. 


That part of Theorem I which does not concern metric transitivity is an 
immediate corollary. 


5. Metric transitivity. Space and Phase Averages in a Chaos. If the 
function f(P) is positive, clearly 


(49) f- _ frre. - + TP) dd. + dn 


42+ eee nt py? eee — pn 
< f hie f f(T: + TT ys + + TalP) dd + dn 
yet... +An*Sr? 
<= aes f(Tis: - -TnP) drx: - + dn. 
Ayes oo. tAn®Sr2epy?+ . . . + hn? 
Hence 


1 oo e ° ° Ad e ° e An Le e e ° ° e 

oe pa V(r) s R sj f(T: DT" DMP) dary dn 
—_— ; _1i- e e ° Ai e e An ° ° e 

ae, 7a f . f f(T, TP) day Addn, 


and expression (48) has the same value for P and all its transforms under 
the group 7,4: --TZ,)*. The condition of positivity is clearly superfluous. 
Thus in case expression (48) does not almost everywhere assume a single 
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value, there will be two classes 8, and S, of elements of S, each of positive 
measure, and each invariant under all the transformations 7\™4- - - Tn). 

A condition which will manifestly exclude such a contingency is that if 
S, and S, are two sub-sets of 9 of positive measure, and « is a positive quantity, 


there always exists a transformation T =T7,™:- - -T,*, such that 
mS,(TS2) 
(51) ms = ms, < €. 


From this it will immediately follow that if a chaos is metrically transitive in 
the sense of paragraph 3, the group of transformations of the « space generated 
by translations of the chaos will have the property we have just stated, and 
under the assumptions of Theorem I, 


a4 . 1 ° eo ee 
(52) lim Fay f = f {Fy ..., n(35%) dys: > dyn 


T—-0O 
will exist and have the same value for almost all values of a. 
If almost everywhere 


(58) Tima ff MBB aD} dw =A, 


TOM 
and 


(54) oa fof PB Bid dyn < 92) 


where g(a) belongs to LZ, then by dominated convergence, 


(55) A—lim 775 ma 7 ie f dy; ° dyn f da®{ By, ..., mn (33%) } 
ai &{3 (352) }da. 


That is, the average of ®{%(2;a)}, taken over the finite phase space of , is 
almost everywhere the same as the average of ®{%y,,..., y,(%3%)} taken over 
the infinite group space of points y1,: --,Yn. This completes the establish- 
ment of Theorem J, and gives us a real basis for the study of the homogeneous 
chaos.* 


6. Pure one-dimensional chaos. The simplest type of pure chaos is 
that which has already been treated by the author in connection with the 
Brownian motion. However, as we wish to generalize this theory to a multi- 


4The material of this chapter, in the one-dimensional case, has been discussed by 
the author with Professor Eberhard Hopf several years ago, and he wishes to thank 
Professor Hopf for suggestions which have contributed to his present point of view. 
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plicity of dimensions, instead of referring to existing articles on the subject, 
we shall present it in a form which emphasizes its essential independence of 
dimensionality. 

The type of chaos which we shall consider is that in which the expression 
(33%) has a distribution in @ dependent only on the measure of the set 3; 
and in which, if 3, and 3, do not overlap, the distributions of 3 (3h, a) and 
(2%2,%) are independent, in the sense that if (2, y) is a measurable func- 
tion, and either side of the equation has a sense, 


(56) f" Jf, (5 (25 a), Be 8) dads — f" 4(F(%, 2), 8B, a))da 


We assume a similar independence when n non-overlapping sets 31,° °°, Sn 
are concerned. It is by no means intuitively certain that such a type of chaos 
exists, In establishing its existence, we encounter a difficulty belonging to 
many branches of the theory of the Lebesgue integral. The fundamental 
theorem of Lebesgue assures us of the possibility of adding the measures of a 
denumerable assemblage of measurable sets, to get the measure of their sum, 
if they do not overlap. Accordingly, behind any effective realization of the 
theory of Lebesgue integration, there is always a certain denumerable family 
of sets in the background, such that all measurable sets may be approximated 
by denumerable combinations of these. This family is not unique, but without 
the possibility of finding it, there is no Lebesgue theory. 

On the other hand, a theory of measure suitable for the description of a 
chaos must yield the measure of any assemblage of functions arising from a 
given measurable assemblage by a translational change of origin. This set of 
assemblages is essentially non-denumerable. Any attempt to introduce the 
notion of measure in a way which is invariant under translational changes of 
origin, without the introduction of some more restricted set of measurable sets, 
which does not possess this invariance, will fail to establish those essential 
postulates of the Lebesgue integral which deal with denumerable sets of points. 
There is no way of avoiding the introduction of constructional devices which 
seem to restrict the invariance of the theory, although once the theory is 
obtained it may be established in its full invariance. 

Accordingly, we shall start our theory of randomness with a division of 
space, whether of one dimension or of more, into a denumerable assemblage of 
sub-sets. In one dimension, this division may be that into those intervals 
whose coordinates are terminating binary numbers, and in more dimensions, 
into those parallelepipeds with edges parallel to the axes and with terminating 
binary coordinates for the corner points. We then wish to find a self-consistent 
distribution-function for the mass in such a region, dependent only on the 
volume, and independent for non-overlapping regions. 
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This problem does not admit of a unique solution, although the solution 
becomes essentially unique if we adjoin suitable auxiliary conditions. Among 
these conditions, for example, is the hypothesis that the distribution is sym- 
metric, as between positive and negative values, has a finite mean square, and 
that the measure of the set of @’s for which %(S;¢) >A is a continuous 
function of A.> Without going into such considerations, we shall assume 
directly that the measure of the set of instances in which the value of %(S; @) 
in a region S of measure M lies between a and 6b > 4, is 


- 1 v wu? P 
~ yeau J, (— ar) 
The formula 
1 e uw (vy — u) *) 
58 eee os Meets ee ig 
Of V 22MM, S.. =e ( 2M, 2M, . 


1 v? 
 VML+ 4M, exp ( 2(Mi a) 
shows the consistency of this assumption. 

The distributions of mass among the sets of our denumerable assemblage 
may be mapped on the line segment 0 =a=1, in such a way that the 
measure of the set of instances in which a certain contingency holds will go 
into a set of values of a of the same measure. This statement needs a certain 
amount of elucidation. To begin with, the only sets of instances whose 
measures we know are those determined by 


a,= (8134) Sb, 
do = H(S2;3%) S be 


On S (Sn; %) S On; 


(59) 


where 8, S2,: - -,S»n are to be found among our denumerable set of sub- 
divisions of space. However, once we have established a correspondence between 
the measures of these specific sets of contingencies and their corresponding 
sets of values of «, we may use the measure of any measurable set of values 
of a to define the measure of its corresponding set of contingencies. 

The correspondence between sets of contingencies and points on the line 
(0,1) is made by determining a hierarchy of sets of contingencies 


G1, (Mm) <= (S10 5 a) <= b, (mn) 
(60) oe ee ae oe Be 
Ay (mn) = vy (Sym) 5 a) <= by (mn) | 


5 Cf. the recent investigations of Cramér and P. Lévy. 
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Let us call such a contingency Cm». If m is fixed, let all the contingencies 


Cmn (m2 =1,2,- - -) be mutually exclusive, and let them be finite in number. 
Let us be able to write 

N 
(61) Cm,n —e > CO ms1,ny 

iat 


If S’ is one of our denumerable sets of regions of space, let every Om n with a 
sufficiently large index m be included in a class determined by a set of con- 
ditions concerning the mass on S’ alone, and restricting it to a set of values 
lying in an interval (c,d), corresponding to an integral of form (57) and of 
arbitrarily small value. (Here d may be o, or c may be — oo.) Let us put 
C;,, for the entire class of all possible contingencies, and let us represent it 
by the entire interval (0,1). Let us assume that Cm,» has been mapped into 
an interval of length corresponding to its probability, in accordance with (57), 
and let this interval be divided in order of the sequence of their 7’s into 
intervals corresponding respectively to the component Cy41,n,’8, and of the 
same measure. lixcept for a set of points of zero measure, every point of the 
segment (0,1) of a will then be determined uniquely by the sequence of the 
intervals containing it and corresponding to the contingencies Cm,» for suc- 
cessive values of m. This sequence will then determine uniquely (except in a 
set of cases corresponding to a set of values of « of zero measure) the value 
of (Sn;) for every one of our original denumerable set of sets Sn. 

So far, everything that we have said has been independent of dimen- 
sionality. We now proceed to something belonging specifically to the one- 
dimensional case. If the original sets S, are the sets of intervals with binary 
end-points, of such a form that they may be written in the binary scale 


(62) (didz - . dn dies ' ; ‘di, d,° : An Ais * . ‘dy + 2-11) 


where di,- - -, dj are digits which are either 0 or 1, then any interval whatever 
of length not exceeding 2“ and lying in (a,b) (where a and 6 are integers) 
may be written as the sum of not more than two of the (6 — a)2#** intervals 
of form (62) lying in (a,b) and of length 2-7, not more than two of the 
(b —a)2#** intervals of length 2-“?, and so on. The probability that the 
value of | (Sn; )| should exceed A, or in other words, the measure of the 
set of «’s for which it exceeds A, is 


2 ser u? : 
(63) Tew J, exp & as) du = o{exp(— Am(S;,,) a) }. 


Now let us consider the total probability that the value of | 4(Snx:¢)| 
should exceed 2-‘#+1)(#-€) for any one of the (b — a)2#** intervals of length 
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a3, or 2-2) (Fe) for any one of the (b —a)2 intervals of length 2-#, 
or soon. This probability can not exceed 


(64) >> (6 — a)2"** o(exp(— 2(#)€)) == 0 (2% exp(— 2 (ute) ), 


On the other hand, the sum of | §(Sn;)| for all the 2+ 2-+- - - intervals 
must in any other case be equal to or less than 


(65) 9 3 Q-(urk) (4-€) O (2-H) ) , 
k=1 


Thus there is a certain sense in which over a finite interval, and except for a 
set of values of « of arbitrarily small positive measure, the total mass in a 
sub-interval of length = 2~ tends uniformly to 0 with 2. On this basis, 
we may extend the functional %(S;«) to all intervals S. It is already defined 
for all intervals with terminating binary end-points. If (c,d) is any interval 


whatever, let c:,¢2,: - - be a sequence of terminating binary numbers ap- 
proaching c, and let d,,d2,: - - be a similar sequence approaching d. Then 
except for a fixed set of values of « of arbitrarily small measure, 

(66) Him | &( (6m, dn) + &) — (Cem, dm) 5%) | == 0, 


and we may put 


(67) wu ((c, d) 5 @) == lim 3 ( (Cm, din) 3 %). 


Formula (57), and the fact that 3(S:; a) and %(S2; a) vary independently 
for non-overlapping intervals S, and S2, will be left untouched by this extension. 

Thus if % is an interval and 7% a translation through an amount 4, 
we can define %(7*3; a), and it will be equally continuous in A over any finite 
range of A except for a set of values of « of arbitrarily small measure. From 
this it follows at once that it is measurable in A and @ together. Furthermore, 
we shall have 


Measure of set of @’s for which 3(T*3; a) belongs to C = 


ee) Measure of set of as for which %(3;) belongs to C. 


Thus %(%; 4) is a homogeneous chaos. We shall call it the pure chaos. 

If ©{% (3; a)} is a functional dependent on the values of (3; ) for a 
finite number of intervals 3, then if A is so great that none of the intervals 
x» overlaps any translated interval T*3m, ®{3 (3; «)} will have a distribution 
entirely independent of ®{% (73; a)}. As every measurable functional may 
be approached in the Z sense by such a functional, we see at once that §(3; =) 
is metrically transitive.® 


° Except that the method of treatment has been adapted to the needs of §7, the 
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7. Pure multidimensional chaos. In order to avoid notational com- 
plexity, we shall not treat the general multidimensional case explicitly, but 
shall treat the two-dimensional case by a method which will go over directly 
to the most general multidimensional case. If our initial sets S, are the 
rectangles with terminating binary codrdinates for their corners and sides 
parallel to the axes, and we replace (a,b) by the square with opposite vertices 
(p,q) and (p+r, ¢-+7r), an argument of exactly the same sort as that 
which we have used in the last paragraph will show that except in a set of 
cases of total probability not exceeding 


CO WW 
(69) const. S) & 2424 o (exp (— 24D) ) — 0 (24 exp(— 21)*) ) 

k=1 J=1 
the sum of | §(Sx;%)| for a denumerable set of binary rectangles with base 
<= 2“, of the form (62), and adding up to make a vertical interval lying in 
the square (p,q), (p-+7,¢-+7), must be equal to or less than 


(65) 25 O-(M+k) (B-€) O (2-6) ) | 
=1 

If we now add this expression up for all the base intervals of type (62) neces- 
sary to exhaust a horizontal interval of magnitude not exceeding 2, we shall 
again obtain an expression of the form (65). It hence follows that if we take 
the total mass on the codrdinate rectangles within a given square, this will 
tend to zero uniformly with their area, except for a set of values of a of 
arbitrarily small measure. From this point the two-dimensional argument, 
and indeed the general multidimensional argument, follows exactly the same 
lines as the one-dimensional argument. It is only necessary to note that if 


dn >a, bn > b, On > 6, dnd 


then the rectangles (dm, bm), (Cm, dm) and (Gn, bn), (Cn, dn) differ at most by 
four rectangles of small area.’ 

From this point on, we shall write ® (S;«) for a pure chaos, whether in 
one or in more dimensions. 


8. Phase averages in a pure chaos. If f(P) is a measurable step- 
function, the definition of 


(70) JS #P)adeP (Ss) 


results of this section have previously been demonstrated by the author. (Proceedings 
of the London Mathematical Society, 2, vol. 22 (1924), pp. 454-467). 
7 Here we represent a rectangle by giving two opposite corners. 
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is obvious, for it reduces to the finite sum 


N 
(71) X fnP (Sn; a) 


where fx are the N values assumed by f(P), and S, respectively are the sets 
over which these values are assumed. Let us notice that 


(72) fo dal fi F(P)dr® (95a) [? = SZ defnfaP (Sms 2) (Ss 5 2) 


m=1 n=1e 


= Stal ( (P (8n5.2)) da 


| fy? 4” wep (— es) a 
lO UW CX So ee U 
1 V/ 2am (Sn) A -co P 2m (Sn) 
N 1 oo 
— n |"™m hoe ane f ure (#72) dy 
S| fs [2m (Sn) af 
N 
2 
1 


| fa |?m (Su) = f | F(P)PaVe, 


the integral being taken over the whole of space. In other words, the trans- 


formation from f(P) as a function of P, to f f(P)dpP (S; a) as a function 
of a, retains distance in Hilbert space.* Such a transformation, by virtue of 
the Riesz-Fischer theorem, may always be extended by making limits in the 
mean correspond to limits in the mean. Thus both in the one-dimensional 
and in the many-dimensional case, we may define 


(78) f FP)deP (83a) = Lim. f fa(P)de? (S32) 


where f(P) is a function belonging to L?, and the sequence f,(P),fe(P),° °° 
is a sequence of step-functions converging in the mean to f(P) over the whole 
of space. The definition will be unambiguous, except for a set of values of « 
of zero measure. 

If S is any measurable set, we have 


1 1 oOo a” 
(74) f da{P(S;a)} -leas J exp (— aay) 
= (m(S))"” eS. ure (w/?) dy 


—0 if n is odd 
= (m(8) )”/?(n—1)(n— 38): --1 if n is even. 


8 Cf. Paley, Wiener, and Zygmund, Mathematische Zeitschrift, vol. 37 (1933), 
pp. 647-668. 
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This represents (m(S))"/?, multiplied by the number of distinct ways of 
representing ” objects as a set of pairs. Remembering that if 8,,S2,- ° +, Son 
are non-overlapping, their distributions are independent, we see that if the 
sets 31, S2,° °°, Sen are either totally non-overlapping, or else such that when 
two overlap, they coincide, we have ° 


(75) f Pasa) ns P (Sy3a)da— SI fF (3); a) ® (Se a) da, 


where the product sign indicates that the 2n terms are divided into n sets of 
pairs, 7 and &, and that these factors are multiplied together, while the addi- 
tion is over all the partitions of 1,- - -,2n into pairs. If 2n is replaced by 
en + 1, the integral in (75) of course vanishes. 

Since P(S;«a) is a linear functional of sets of points, and since both 
sides of (75) are linear with respect to each P (33a) separately, (75) still 
holds when 3, 32,- - -, Sen can be reduced to sums of sets which either coin- 
cide or do not overlap, and hence holds for all measurable sets. 

Now let f(P:,° - -, Pn) be a measurable step-function: that is, a function 
taking only a finite set of finite values, each over a set of values P,,-- +, Pn 
which is a product-set of measurable sets in each variable Py. Clearly we 
may define 


(16) fies fue + +, Pa)de,® (85a) «+ de, P(S32) 


in a way quite analogous to that in which we have defined (70), and we shall 
have 


(77) fo a  f F(Pas + +, Ped? (85 a) dp, P (S32) 
—3f 25 Sf f(Psy Pa Px Pa i PoP Vd a Ped Ve 


where the summation is carried out for all possible divisions of the 2n P’s 
into pairs. Similarly in the odd case 


(78) fia ff ; f fle. > +, Pons) dp,P (S3a)- > - dp, P(S3a) =—0. 


We may apply (77) to give a meaning to 


(79) J. aif . f f(P1,: + +, Pn)dp,P(S; a) > - -dp,P(S3 a) |2. 


If f(P:,: --+,Pn) is a measurable step-function, and 








° Cf. Paley, Wiener, and Zygmund, loc. cit., formula (2.05). 
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(80) |f(Pi---+,Pr)j< | fi(Ps) EPA 4 
fl) PaveS 4 (b= 1,2,--9) 


we shall have 


1 
(81) fo dal ff f(Ps- + Pa)deP (83a) > - - de® (83.4) 
0 
S A"(2n — 1) (2n — 3)-+- +1. 
If now f(Pi,- > -,Pn) is an integrable function satisfying (80), but not 


necessarily a step-function, let 


(82) fv; Pay-+, Pn) == sgn f(P.,**, Pu) [of (Piss Pa) sem f(Pay 0 Pa) 


Vv 


Clearly almost everywhere 


(83) f- £4 sts +, Py) dp,? (S32) «+ -dp,P (S32) 
<I ff fe(P)de? (83) 


and 

84) Lm eens ;P.,- + +,Pn)dp,P (8S; a) +: -dp,P(S; 

( ) Tm | f f tus )dp ( a) e ( #) 
—f. f f(v; Pa, + +, Pn) dp,P (S; «) dp,P? (S; a) | 


; » | f,, fe(P) de® (83 a) 
SII f fu(P)de® (8; ) e+ 35 —-—_____ 
1 s f fu(P)dpP (8; a) 


where & represents the exterior of a sphere of arbitrarily large volume. Let 
it be noted that both the numerator and the denominator of this fraction have 
Gaussian distributions, but that the mean square value of the numerator is 
arbitrarily small. Thus except for a set of values of « of arbitrarily small 
measure, the right side of expression (84) is arbitrarily small, so that we may 
write 


(85) lim | ff fusPu + * Pa) de® (85a) + + -de,P (932) 


Ly V-> 00 
ae i -~ f fsP.. - +, Pn)dp,P (S3a)- - ‘dp, P (S30) | = 0, 


Thus by dominated convergence, 


(86) lim f-- ~~ f f(us Pays +. Px) dr, ® (83) - + - de® (83 @) 
UO e« e 
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exists for almost all values of «, and we may write it by definition 


(sv) fs ( F(Pu > +, Pa)de,P (83.4) > + de, P (S52), 


This will clearly be unique, except for a set of values of a of zero measure. 
There will then be no difficulty in checking (77), (78), and (81). 


9. Forms of chaos derivable from a pure chaos. Let us assume that 
f(P) belongs to L*, or that f(P1,: > +, Pn) is a measurable function satisfying 
—~ 


(80). Let us write PQ for the vector in n-space connecting the points P and 
Q. Then the function 


(88) f- . f f(PP.,: >, PPx)dp,P? (S30) + > dp,P(S3a) = F(P; a) 


is a metrically transitive differentiable chaos. This results from the fact that 
@(S;a) is a metrically transitive chaos, and that a translation of P gen- 
erates a similar translation of all the points P,. The sum of a finite number 
of functions of the type (88) is also a metrically transitive differentiable chaos. 
To show that F(P;a) is measurable in P and @ simultaneously, we merely 
repeat the argument of (83)—(86) with both P and «@ as variables. 

We shall call a chaos such as (88) a polynomial chaos homogeneously of 
the n-th degree, and a sum of such chaoses a polynomial chaos of the degree 
of its highest term. In this connection, we shall treat a constant as a chaos 
homogeneously of degree zero. 

By the multidimensional ergodic theorem, if ® is a functional such that 


(89) {| 8@Rs4)) | lost | BFP 9))| da < 
we shall have 
(90) limos Sf, 2(F(P;4))dVr—= f o(F(P32)) da 


for almost all values of «. Since the distribution of ’(P; a) is dominated by 
the product of a finite number of independent Gaussian distributions, we 
even have 


(91) cs | F(P;a)|"da < a 


for all positive integral values of n. In a wide class of cases this enables us 
to establish a relation of the type of (89). 

In formula (90), we have an algorithm for the computation of the right- 
hand side. For example, if 
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(92) B(P 32) =f f(PP,)dp,9 (S54), 


and P+ @ is the vector sum of P and Q, we have for almost all a, 


(98) lim pow f F(P+Q3)F(Q;2)aVo— f FP + O)F(Q)AVo, 


the integral being taken over the whole of space; if 


(04) F(P3a) = f+ +» f f(BP,, PP.) de,P (S5 a) de,® (8; 2), 
we have almost always 
(95) lime f P(P + 3a) (Q; 2)aVe 
=| f 1(Q,Qavelr+ ff E+QP+M)F(Q M)avedVa 


+f f fP+OP+ UW) F(M, Q)dVedVus 


and if 
(96) F(P3a)— fff f(PP., PP. PP.) dv,P (83 a) de,P (S;2)dr,? (S52), 


we have almost everywhere 


(97) lim 5 DG f, F(P+Q;a)F(Q;a)dVo 


=S Sf 100.749 700,8,5 


+ #(Q,9,P + M)f(S, M, 8) + 7(Q,9,P + M)f(S, 8, UM) 
+ #(Q,P +M, Q)f(M, S,8) + 1(Q,P + MU, Q)F(S, M, 8) 
+ #(Q,P + M, Q)F(S,8,M) + 7(P + M, Q, Q)F UL S, 8) 
+ f(P+M, Q,Q)F(S, M, 8) + f(P+ M, Q, Q)F(S, 8, M) 
4 f(PLOP+M,P+8)7(Q, M8) 
+f(P+Q,P+M,P+8)f(9,8,M) 
4 F(P+Q,P+M,P+8)F(M, 9,8) 
4 f(P+QP+M,P +8) FM, S, Q) 
4 f(P4Q,P4+M,P+S)F(S, 0,1) 


We have similar results in the non-homogeneous case. Thus if 
(98) F(P:a) =A + f f(PP;) dp,? (S83) 
+ ff g(PP, PP.)de, (8; 2) de,? (834), 
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we have almost everywhere 


(99) impo f F(P + 03.0) F (Qs) aV< 
=A f 9(0,0)dVo+4 § 9(0,0)aVo 
+ fFP+QOFOQ)AVe +1 J (0, 0)aVo I 
+ f [P+ OP + M)9(Q, M)aVodVn 
+ f (9 P+OP +1) GM, Q)dV 0d x 


10. Chaos theory and spectra.*° The function 
(100) eee f F(P + Q)F(Q)dVo = G(P) 
TOO V(r) R 


occupies a central position in the theory of harmonic analysis. If it exists 
and is continuous for every value of P, the function F'(P) is said to have an 
n-dimensional spectrum. To define this spectrum, we put 


F(P) on &; 


on) sae 0 elsewhere. 


It is then easy to show by an argument involving considerations like those of 
(49) that if 
1 a 
(102) GP) ay [PAP + Q)BH(Q)aVe, 
V(r) Joo 
the integral being taken over the whole of space, then we have 
(103) G(P) =lm G;,(P). 
roo 
Since, if O is the point with zero codrdinates, by the Schwarz inequality, 
(104) | G(P)| = @(0), 
the limit in (103) is approached boundedly. 
If now we put 

(105) r(U) = (20) V (r)2L im, f F,(P)eU-PdV p 

8-00 S 


where S is the interior of a sphere of radius s about the origin, the n-fold 
Parseval theorem will give us 


10 Cf. N. Wiener, “ Generalized harmonic analysis,” Acta Mathematica, vol. 55 
(1930). 
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(106) (oe (an) G.(P) eU-PdVp. 


If M(U) is a function with an absolutely integrable Fourier transform, we 
shall have 


(107) f | &(U) [> M(U)dVy— ny" f G(P)dV» f M(W)e dV 
and hence 

(108) Jim | e(U) 7 M(O)aVo—= (2x) G(P)dVr f M(U)ePaVy 
which will always exist. Let us put 

(109) MEM (U)} = (20) f _O(P) Vp J MOePaVo. 


If § is any set of points of finite measure, and S(P) is its characteristic 
function, let us put 


(110) H(S)= Lub. M(M(U)), 
M(U)2S(U) 
and 
(111) N(S) = g.lb. M(M(U)). 
= M(U)=S(U) 


If %(S) and (8) have the same value, we shall write it 9/(8), and shall 
call it the spectral mass of F on S. It will be a non-negative additive set- 
function of S, and may be regarded as determining the spectrum of fF. 

If f(P:,° - +,Pn) satisfies (80) and F(P;«a) is defined as in (88), we 
know that for any given P, 


(112) G(P3a)— lima f F(P+Q3a)F(Qia)— fF(P,8)P(0,8) dp 


oo V(r) 


for almost all values of « This alone is not enough to assure that F'(P, «) 
has a spectrum for almost all values of a, as the sum of a non-denumerable set 
of sets of zero measure is not necessarily of zero measure. On the other hand, 
except for a set of values of a of zero measure, ((P, x) exists for all points P 
with rational codrdinates. 

We may even extend this result, and assert that if 


(113) Py(P3 2) — Fey fj F(S;4)dV5 


length of PS<6 
and 


(114) @9(P32) = lim = wads Fy (P+ Q3a)Fg(Q; a) dV, 
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then except for a set of values of « of zero measure, Gg(P;«) exists for all 
points P with rational coordinates and all rational parameters 6, and it is 


easily proved that for almost all values of «, as @ tends to 0 through rational 
values, 


(115) lim lim 7 35 J, | Py(Q;.a) —F(Q;a)|? dVo=0. 


Now, by the Schwarz inequality, 
(116) | pay f Fo(P + Q3.9)Fo(Q;.0)aVe 
va JS, Bg(P:1 + 93%) F4(Q; %)dVoq | 
: 1 ot, ae ; 2 ; 
<{ 60054) (ey | Fo(P +054) —Fo(P: + 054)/°aVe) | 


= { See aS. are va L Sinetnae Sma Besa PEO er eve) 
od Or LS nas Si<o “J oe avs) 


< G4 (0; a)0(| PP, [A). 


It thus follows that if (114) exists for a given @ and all P’s with rational 


codrdinates, it exists for that 6 and all real P’s whatever. We may readily 
show that 


(117) Gj(O;a) < G(0;2). 





By another use of the Schwarz inequality, 
(1s) | pay f Fo(P + 034) Fo(Qs2)dVo 
1 e 7 e 
— FJ, FP + OF (Osa) aVe 
Sai f, | Fo(P +050) —F(P+Q54)| | Fo(Qs 9) aVe 
1 . . eat ° 
+ Hay Jp | PP +52)! | Fo(Qs a) —F(Q32)| dVeo 
<(G(054) Fay [| Fo(P+ 05a) —F(P + Q5.a)/* aVo} 
1 i ° ° 2 3 
+ (4(05.2) ary f | Po(Qsa) — F(Qs.a)l* aVo}3 


Combining (115) and (118), we see that except for a set of values of @ of zero 
measure, we have for all P, 


(119) G(P; «) — lim Go(P;«@). 
0 
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We thus have an adequate basis for spectrum theory. This will extend, not 
merely to functions F'(P,a) defined as in (88), but to finite sums of such 
functions. It will even extend to the case of any differentiable chaos F(P;«), 
for which F(P + Q;a)F(Q;«) is an integrable function of «, and for which 
(115) holds. For a metrically transitive chaos, this latter will be true if 


1 
(120) lim f | Fy(P; a) — F(P3a)|? da =. 


Under this assumption, we have proved that #(P) has a spectrum, and the 
same spectrum, for all values of a. 

This enables us to answer a question which has been put several times, 
as to whether there is any relation between the spectrum of a chaos and the 
distribution of its values. There is no unique relation of the sort. The 
function 


(121) f (P+ 2)9(Q) AV 


where g belongs to L?, may be so chosen as to represent any Fourier transform 
of a positive function of DL, and if f(P,Q, M) is a bounded step-function, the 
right-hand side of (97) will clearly be the Fourier transform of a positive 
function of Z. In particular, let f(P1, P2, Ps) =f(Pi)f(P2)f (Ps), 


(122) B,(P3a) = { f(PPs)dp,? (S54) 
and choose f,(@) in such a way that 


(123) f f.(P + Q)f:(Q)dVo =tight-hand side of (97). 
Then 


(124) f (F,(P3a))da—(f" |fi(P) ap )»(2n—1) (2n—8) ro 
and if F(P, a) is defined as in (96), 

(125) in (F(P32))*"da=(f |f(P)|*dV )'™(6n—1) (6n—3) 1) 
so that for all but at most one value of n, 

(126) f (F,(P; a) ?"da an (F(P; «)) da 


and we obtain in # and F, two chaoses with identical spectra but different 
distribution functions. On the other hand, if 


(127) f IAC) aVe— | fa(P)/? aVr, 
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the chaoses 


(128) f f,(PP:) dp,? (S; a) 
and 
(129) f fe(PP.)ar,9 (S; a) 


will have the same distribution functions, but may have very different spectra. 


11. The discrete chaos.11 Let us now divide the whole of Euclidean 
n-space dichotomously into sets Sinn, such that every two sets Sin, n, aDd Sing,ng 
have the same measure, and that each Sn is made up of exactly two non- 
overlapping sets Smii,z. Let us divide all these sets into two categories, 
“occupied,” and “empty.” Let us require that the probability that a set be 
empty depend only on its measure, and that the probability that two non- 
overlapping sets be empty be the product of the probabilities that each be 
empty. Let us assume that both empty and occupied sets exist. Let every 
set contained in an empty set be empty, while if a set be occupied, let at least 
one-half always be occupied. We thus get an infinite class of schedules of 
emptiness and occupiedness, and methods analogous to those of paragraph 6 
may be used to map the class of these schedules in an almost everywhere one- 
one way on the line (0,1) of the variable «, in such a way that the set of 
schedules for which a given finite number of regions are empty or occupied 
will have a probability equal to the measure of the corresponding set of 
values of a. 

By the independence assumption, the probability that a given set Sin,n be 
empty must be of the form e-4”" Sm). Tf Simm is divided into the 2” intervals 
Smav,ny’ °° Smav,nev at the v-th stage of sub-division, the probability that just 
one is occupied and the rest are empty is 


(130) 2”(1 — exp(— Am(Sm,n) /2”) exp (— al a : Am (Snr) ; 





This contingency at the v + 1-st stage is a sub-case of this contingency at the 
v-th stage. If we interpret probability to mean the same thing as the measure 
of the corresponding set of @’s, then by monotone convergence, the probability 
that at every stage, all but one of the subdivisions of Sm,» are empty, while the 
remaining one is occupied, will be the limit of (180), or 


(131) Am (Smm,n)exp(— Am(Smn) ). 








11'The ideas of this paragraph are related to discussions the author has had with 
Professor von Neumann, and the main theorem is equivalent to one enunciated by the 
latter. 
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Such a series of stages of subdivision will have as its occupied regions exactly 
those which contain a given point. 

The probability that the occupied regions are exactly those which contain 
two points is the probability that each half of Sm,n contain exactly one point, 
plus the probability that one-half is empty, and that in the occupied half, each 
quarter will contain exactly one point, plus and so on. This will be 


in) Am (Sme) = < ~~) \* 


4 
ED 9 exp (An) Sa exp (— eS) 
+ ee ee EXD (— Am(Sm,n)™m(Sm,n) A -- 4 ss = _) 
= Fm Ene) ) exp (— Am(Simn))- 


If the probability that the occupied regions are exactly those containing / — 1 
points is 
(Am (Sinn) )™ 


eye ee) 


then a similar argument will show that the probability that the occupied 
regions are exactly those containing & points will be 


m1 ; ee 
(183) 2 7! Cy (Am (Sinn) ) exp (— Am (Sin,n) ) (: T oR oc 4k-1 se ) 


a ae | l ; (Am (Sinn) )* exp (— AM(Smn,n) ) 





a Ska 


= a (Am (Sinn) ye exp (= Am (S8mn,n) ) : 


Thus by mathematical induction, the probability that the occupied regions 
are exactly those containing / points will be 


oy (Am (Sn) )* exp (— Am(Smn)) 
and the sum of this for all values of k will be 
(134) SE (Am (Smn))® exp (— Am(Smn)) = 1. 
In other words, except for a set of contingencies of probability zero, the 


occupied regions will be exactly those containing a given finite number of 
points. 
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We may proceed at once from the fact that the probability that a set 8, 
contains exactly k points is 


: (Am(8;) \" e-Am (Sj) 


while the probability that the non-overlapping set S. contains exactly k 
points is 


1 
k! (Am (S82) )* e-Am (Se) 


to the fact that the probability that the set S,-+ S82 contains exactly k 
points is 


(135) S22 


] ! (k—j)! (Am (S;) )i (Am (S82) ) Je Am (S482) 


= Fy (Am (8; + So) )Feram Sess, 


From this, by monotone convergence, it follows at once that the probability 
that any set S which is the sum of a denumerable set of our fundamental 
regions Sim,n Should contain exactly k points is 


1 
al (Am(8) )ke-Am(S) , 


It is then easy to prove this for all measurable sets V. 

We are now in a position to prove that the additive functional D(S; a), 
consisting in the number of points in the region 8 on the basis of the schedule 
corresponding to a, is a homogeneous metrically transitive chaos. The réle 
which continuity filled in paragraphs 6 and 7, of allowing us to show that 
Ou, oe. y,(S 3%) was measurable in y1,° - >, Yn and @, is now filled by the fact 
that the probability that any of the points in a region lie within a very small 
distance of the boundary, is for any Jordan region the probability that a small 
region be occupied, and is small. The metric transitivity of the chaos results 
as before from the independence of the distribution in non-overlapping regions. 

The discrete or Poisson chaos which we have thus defined is the chaos of 
an infinite random shot pattern, or the chaos of the gas molecules in a perfect 
gas in statistical equilibrium according to the old Maxwell statistical me- 
chanics. It also has important applications to the study of polycrystalline 
aggregates, and to similar physical problems. 

Two important formulae are 


(136) f, D(Ssa)da— eames Ee (Am(S))*& = Am(8), 
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and 

(137) f(D (85.2) )%da— e-sm S IE (4m(S8))* = (Am(8))* + Am(8). 
Let it be noted that if we define 

(138) f f(P)dpD (S; a) 

for a measurable step-function f(P) as in (70), by 

(139) > fuD (Sn5.2), 

(72) 1s replaced by 

(140) fda | f° f(P)drD (85a) —A Ff F(Q)aVe | 


N N 1 ss 
=> > da fnfnD (Sm; %)D (Sn5%) 


—2R {AE [da fnD (Sus) { FQ)AVe+ 14S F(Q)dVe 
— 3 | fn |? Am(Sn) 
—Af | f(P)IPaVr, 

Thus the transformation from f(P) as a function of P, to 


(141) J f(P)deD (S32) Af F(Q)aV< 


as a function of a, retains distance in Hilbert space, apart from a constant 
factor, and if f(P) belongs to LZ and _L? simultaneously, and {fn(P)} is a 
sequence of step-functions converging in the mean both in the LZ sense and in 
the L? sense to f(P), we may define 


(142) { f(P)deD (830) = 4 f H(Q)AV< 
+liiem. (f fa(P)deD(S5a)—A f fa(Q)AVe). 


As in the case of (73), this definition is substantially unique. We may prove 
the analogue of (93) in exactly the same way as (938) itself, and shall obtain 


(143) tim af) f+ Oils a)duD (83a) —4 f° FaN)aVw 
x} fF Girs «)dv® (854) —A FAN Avy | aVe 


— A? f fe + Q)F(Q)dVo. 
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As we may see by appealing to the theory of spectra, one interpretation of this 
in the one-dimensional case is the following: /f a linear resonator be set into 
motion by a haphazard series of impulses forming a Poisson chaos, the effect, 
apart from that of a constant uniform stream of impulses, will have the same 


power spectrum as the energy spectrum of the response of the resonator to a 
single impulse. 


12. The weak approximation theorem for the polynomial chaos. We 
wish to show that the chaoses of paragraph 9 are in some sense everywhere 
dense in the class of all metrically transitive homogeneous chaoses. We shall 
show that if %(S;a) is any homogeneous chaos in n dimensions, there is a 
sequence (8S; ) of polynomial chaoses as defined in paragraph 9, such that 
if S,,- - -,S8v 1s any finite assemblage of bounded measurable sets in n-space 
selected from among a denumerable set, and 


(144) f. | (Sy; 4%) |4#da< x (Ase 125 ey) 


is finite, then 


(145) [ S(S.54). §(Sv5.a)da— lim f” B(8154)- + + Bn(Sv; a) da. 


We first make use of the fact that if the probability that a quantity u be 
greater in absolute value than A, be less than 





2 ee 
(146) — | gw 28) ats 
V 2rB A 
then if w(w) is any even measurable function bounded over (— ©, 0), we 
may find a polynomial ¥.(u), such that the mean value of 


(147) | p(w) — Ye(u) |", 
which will be 

1 
V 2rB J- 





(148) | aia) — Ye(a) | e/a, 


is less than «. Since it is well known that if ¢(w) is a continuous function 
vanishing outside a finite interval, and 


0O 
(149) ¥ AnHn(u)e@) 


1 
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is the series for ¢(w) in Hermite functions, then we have uniformly 
. CO 
(150) o(u) = lim 3 Ant"An(uj)e?”), 
t>1-0 1 
to establish the existence of y.(u), we need only prove it in the case in which 


(151) W(u) = uke Cw 


for an arbitrarily small value of C’: as for example for C = 1/4n,B. We shall 
then have 











2) i 
(152) yw) 3 2 = | uU peewee. 
so that by dominated convergence, and if we take N large enough, we may 
make 
(153) =f lvw) rertwretg = 
——— U) —we(u) |\"e “PAI du < msn). 
eg SV) He) © (nS) 
Now let 


0 l<K); 
(154) WK (uU) -| 1 : or 


and let us put 

(155) 8 (P32) => r) 
The chaos 

(156) L(P3 4) = dx(S(P;2)) 


P(S;a) (S = interior of OP |<"). 


may then be approximated by polynomial chaoses in such a way as to approxi- 
mate simultaneously to all polynomials in &(P;«) by corresponding poly- 
nomials in the approximating chaoses. Since the distribution of the values 
of &§(P;a) will be Gaussian, with a root mean square value proportional to a 
power of 7, and &(P;«) will be independent in spheres of radius y about two 
points P, and P, more remote from each other than 27 + 2y, it follows that 
if we take K to be large enough, we may make the probability that £(P; «) 
differs from 0 between two spheres of radii respectively 7 -- y and H about a 
given point where it differs from 0, as small as we wish. 
We now form the new chaos 


|PQ|<a# 


which we may also approximate, with all its polynomial functionals, by a 
sequence of polynomial chaoses. The use of polynomial approximations 
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tending boundedly to a step function over a finite range will show us that 
this is also true of the chaos determined by 


(158) bo f ah (Q54)dVo—= M(P3 a). 


By a proper choice of the parameters, this can be made to have arbitrarily 
nearly all its mass uniformly distributed over regions arbitrarily near to 
arbitrarily small spheres, all arbitrarily remote from one another, except in 
an arbitrarily small fraction of the cases. We then form 





| NEO i pa: 
(199) te f_ (Qa) +8) exp (— SEE) avg — 1 (Pi) 
where 6 is taken to be very small. This chaos again, as far as all its poly- 
nomial functionals are concerned, will be approximable by polynomial chaoses. 
Since it is bounded away from 0 and o, and since over such a range the 


function 1/z may be approximated uniformly by polynomials, it follows that 
in our sense, 


(160) 1/N (P; «) 


is approximable by polynomial chaoses. 
If w(P) is any measurable function for which arbitrarily high moments 
are always finite, it is easy to show that 


| PQ 
2k 








(161) ote f_7(Q) (Osa) +8) exp(— EET) ave — (Ps) 


is approximable by polynomial chaoses. Multiplying expressions (160) and 
(161), it follows that 


(162) W(P3a)/N(P3 a) = UP; «) 


is approximable by polynomial chaoses. 
If A is a large enough constant, depending on the choice of the constant e, 


we have 
dV 
(2a 3 (22k)? S22? »(—- i) J 
{2 oeailag Be anon (#?/2h) gp 


1 ~ Aare) 
< (aakyie SP a 





(163) 
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Thus by the proper choice of the parameters of 9n(P;«), if we take k small 
enough and then 6 small enough, the chaos (162) will consist as nearly as we 
wish, from the distribution standpoint, of an infinite assemblage of convex 
cells of great minimum dimension, in each of which the function w(P) is 
repeated, with the origin moved to some point remote from the boundary. 

Now let 3(S;a) be a metrically transitive homogeneous chaos. Let 
us form 


(164) (83a) Tf Bans an(S5 a) day” - day 


Clearly by the fundamental theorem of the calculus, over any finite region in 
(%1,° * *,%n), we shall have for almost all points and almost all values of a, 


(165) o(S;a%) = lim (7; 83 a); 


aud if (144) holds, it is easy to show that 


1 
(166) J | (73 83a) !"da < const. 
0 


From this it follows that 
1 

(167) lim f lo(r73 83a) —B(S; a) |"da = 0 
rTr—0 0 


and by the ergodic theorem, except for a set of values of a of zero measure, 
as 7 tends to 0 through a denumerable set of values, 


1 1 a e 
(168) lim aay f | Bes... an (75 53 %) 
ya, p58 OS Pr eee a) |"da, mee Ax» =— 0. 


With this result as an aid, enabling us to show that the distribution of G(S; «) 
is only slightly affected by averaging within a small sphere with a given radius, 
or even within any small region near enough to a small sphere with a given 
radius, we may proceed as in (161) and (162) and form the chaos 


(169) Be(85@) — ar raray capeyTe J, Baw 20(95 8) 


(—— 3) 2;?) 
x (M(a1,° + +, 23%) +8) ee a dt,: + + dt. 
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For almost all 8, in each of the large cells of this chaos, (169) will have as 
nearly as we wish the same distribution as some %z, 2,(93;%), where 


(21,° * *,@n) lies in the interior of the cell, remote from the boundary. These 


x, ee we yg 


cells may so be determined that except for those filling an arbitrarily small 
proportion of space, all are convex regions with a minimum dimension greater 
than some given quantity. 

To establish (143), it only remains to show that the average of a quantity 
depending on a chaos over a large cell tends to the same limit as its average 
over a large sphere. To show this, we only need to duplicate the argument 
of paragraph 4, where we prove the multidimensional ergodic theorem, for 
large pyramids with the origin as a corner, instead of for large spheres about 
the origin. We may take the shapes and orientations of these pyramids to 
form a denumerable assemblage, from which we may pick a finite assemblage 
which will allow us to approach as closely as we want to any cell for which 
the ratio of the maximum to the minimum distance from the origin within 
it does not exceed a given amount. It is possible to show that by discarding 
cells whose measure is an arbitrarily small fraction of the measure of all space, 
the remaining cells will have this property. 


13. The physical problem. The transformation of a chaos. The 
statistical theory of a homogeneous medium, such as a gas or liquid, or a field 
of turbulence, deals with the problem, given the statistical configuration and 
velocity distribution of the medium at a given initial time, and the dynamical 
laws to which it is subject, to determine the configuration at any future time, 
with respect to its statistical parameters. This of course is not a problem in 
the first instance of the history of the individual system, but of the entire 
ensemble, although in proper cases it is possible to show that almost all sys- 
tems of the ensemble do actually share the same history, as far as certain 
specified statistical parameters are concerned. 

The dynamical transformations of a homogeneous system have the very 
important properties, that they are independent of any choice of origin in 
time or in space. Leaving the time variable out of it, for the moment, the 
simplest space transformations of a homogeneous chaos 3(S;«) which have 
this property are the polynomial transformations which turn it into 


Ko+f Ky (1 — Yr, 2 — Yay» Ln — Yn) OY eee m(S 3%) dyi*+* dyn 
(170) Sa 
ah fo Kom O50 tH, NM 


tn — Yn) Buca) cate yeti D 1@) sant 
Oy”) ee ey Yn(V) ‘Go a) dy’? ae dyn™. 
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These are a sub-class of the general class of polynomial transformations 


Ko + f Kila, Peas Yay yin) Wyse ym (S35 %)dyi°** dyn 
Gv) + 


+f a fe CORRE SS aaa aa Ug et ea) 
xX On, ..., un(1y (S35 %) *° Wu), ..., univ (S 5%) dyi™ +++ dyn™. 


If a transformation of type (171) is invariant with respect to position in space, 
it must belong to class (170). On the other hand, in space of a finite number 
of dimensions and in any of the ordinary spaces of an infinite number of 
dimensions, polynomials are a closed set of functions, and hence every trans- 
formation may be approximated by a transformation of type (171). 

A polynomial transformation such as (170) of a polynomial chaos yields 
a polynomial chaos. If then we can approximate to the state of a dynamical 
system at time 0 by a polynomial chaos, and approximate to the transformation 
which yields its status at time ¢ by a polynomial transformation, we shall 
obtain for its state at time t, the approximation of another polynomial chaos. 
The theory of approximation developed in the last section will enable us to 
show this. 

On the other hand, the transformation of a dynamical system induced 
by its own development is infinitely subdivisible in the time, and except In 
the case of linear transformations, this is not a property of polynomial trans- 
formations. Furthermore, when these transformations are non-linear, they 
are quite commonly not infinitely continuable in time. For example, let us 
consider the differential equation 


This corresponds to the history of a space-distribution of velocity transferred 
by particles moving with that velocity. Its solutions are determined by the 
equation 

(173) u(a,t) = u(e —tu(z,t), 0), 


or if w is the inverse function of u(z, 0), 
(174) a —tu(2,t) = y(u(s,t)). 


Manifestly, if two particles with different velocities are allowed to move long 
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enough to allow their space-time paths to cross, u(w,¢) will cease to exist as a 
single-valued function. ‘This will always be the case for some value of ¢ if 
u(x,0) is not constant, and for almost all values of ¢ and a if it is a poly- 
nomial chaos. 

By Lagrange’s formula, (174) may be inverted into 


(175) u(x,t) = SP" ((u(a, 0)". 





In a somewhat generalized sense, the partial sums of this formally represent 
polynomial transformations of the initial conditions. However, it is only for 
a very special sort of bounded initial function, and for a finite value of the 
time, that they converge. It is only in this restricted sense that the poly- 
nomial transformation represents a true approximation to that given by the 
differential equation. 

It will be seen that the useful application of the theory of chaos to the 
study of particular dynamical chaoses involves a very careful study of the 
existence theories of the particular problems. In many cases, such as that of 
turbulence, the demands of chaos theory go considerably beyond the best 
knowledge of the present day. The difficulty is often both mathematical and 
physical. The mathematical theory may lead inevitably to a catastrophe 
beyond which there is no continuation, either because it is not the adequate 
presentation of the physical facts; or because after the catastrophe the physical 
system continues to develop in a manner not adequately provided for in a 
mathematical formulation which is adequate up to the occurrence of the 
catastrophe; or lastly, because the catastrophe does really occur physically, and 
the system really has no subsequent history. The hydrodynamical investi- 
gations required in the case of turbulence are directly in the spirit of the work 
of Oseen and Leray, but must be carried much further. 

The study of the history of a mechanical chaos will then proceed as 
follows: we first determine the transformation of the initial conditions gen- 
erated by the dynamics of the ensemble. We then determine under what 
assumptions the initial conditions admit of this transformation for either a 
finite or an infinite interval of time. Then we approximate to the transforma- 
tion for a given range of values of the time by a polynomial transformation. 
Then, having regard to a definition of distance between two functions de- 
termined by the transformation, we approximate to the initial chaos by a 
polynomial chaos. Next we apply the polynomial transformation to the poly- 
nomial chaos, and obtain an approximating polynomial chaos at time t. 
Finally, we apply our algorithm of the pure chaos to determine the averages 
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of the statistical parameters of this chaos, and express these as functions 
of the time. 

The results of such an investigation belong to a little-studied branch of 
statistical mechanics: the statistical mechanics of systems not in equilibrium. 
To study the classical, equilibrium theory of statistical mechanics by the 
methods of chaos theory is not easy. As yet we lack a method of representing 
all forms of homogeneous chaos, which will tell us by inspection when two 
differ merely by an equimeasure transformation of the parameter of distribu- 
tion. In certain cases, in which the equilibrium is stable, the study of the 
history of a system with an arbitrary initial chaos will yield us for large values 
of ¢ an approximation to equilibrium, but this will often fail to be so, particu- 
larly in the case of differentiable chaoses, or the only equilibrium may be that 
in which the chaos reduces to a constant. 
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